Overview

Dataset statistics

Number of variables20
Number of observations338592
Missing cells0
Missing cells (%)0.0%
Duplicate rows319806
Duplicate rows (%)94.5%
Total size in memory51.7 MiB
Average record size in memory160.0 B

Variable types

Numeric17
Categorical3

Warnings

Dataset has 319806 (94.5%) duplicate rows Duplicates
Ba1Chakukaisu4 is highly skewed (γ1 = 20.61354341) Skewed
Ba1Chakukaisu3 has 336163 (99.3%) zeros Zeros
Ba1Chakukaisu4 has 336004 (99.2%) zeros Zeros
Ba1Chakukaisu6 has 320403 (94.6%) zeros Zeros
Ba2Chakukaisu1 has 247146 (73.0%) zeros Zeros
Ba2Chakukaisu2 has 251201 (74.2%) zeros Zeros
Ba2Chakukaisu3 has 246104 (72.7%) zeros Zeros
Ba2Chakukaisu4 has 243609 (71.9%) zeros Zeros
Ba2Chakukaisu5 has 242093 (71.5%) zeros Zeros
Ba2Chakukaisu6 has 83248 (24.6%) zeros Zeros
Ba3Chakukaisu1 has 284762 (84.1%) zeros Zeros
Ba3Chakukaisu2 has 288376 (85.2%) zeros Zeros
Ba3Chakukaisu3 has 286373 (84.6%) zeros Zeros
Ba3Chakukaisu4 has 287120 (84.8%) zeros Zeros
Ba3Chakukaisu5 has 286220 (84.5%) zeros Zeros
Ba3Chakukaisu6 has 148213 (43.8%) zeros Zeros
Ba5Chakukaisu1 has 240470 (71.0%) zeros Zeros

Reproduction

Analysis started2021-04-07 13:16:04.234919
Analysis finished2021-04-07 13:17:54.730485
Duration1 minute and 50.5 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

ChuoChakukaisu6
Real number (ℝ≥0)

Distinct60
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.69307603
Minimum0
Maximum88
Zeros2893
Zeros (%)0.9%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile2
Q15
median9
Q315
95-th percentile26
Maximum88
Range88
Interquartile range (IQR)10

Descriptive statistics

Standard deviation7.879045865
Coefficient of variation (CV)0.7368362332
Kurtosis2.943457118
Mean10.69307603
Median Absolute Deviation (MAD)5
Skewness1.39865532
Sum3620590
Variance62.07936374
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
527001
 
8.0%
425572
 
7.6%
323304
 
6.9%
623207
 
6.9%
719939
 
5.9%
818845
 
5.6%
916868
 
5.0%
216670
 
4.9%
1015503
 
4.6%
1114259
 
4.2%
Other values (50)137424
40.6%
ValueCountFrequency (%)
02893
 
0.9%
18742
 
2.6%
216670
4.9%
323304
6.9%
425572
7.6%
ValueCountFrequency (%)
8819
 
< 0.1%
692
 
< 0.1%
5811
 
< 0.1%
5778
< 0.1%
56152
< 0.1%

Ba1Chakukaisu1
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335908 
1
 
1989
2
 
477
3
 
212
4
 
6

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335908
99.2%
11989
 
0.6%
2477
 
0.1%
3212
 
0.1%
46
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335908
99.2%
11989
 
0.6%
2477
 
0.1%
3212
 
0.1%
46
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0335908
99.2%
11989
 
0.6%
2477
 
0.1%
3212
 
0.1%
46
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335908
99.2%
11989
 
0.6%
2477
 
0.1%
3212
 
0.1%
46
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335908
99.2%
11989
 
0.6%
2477
 
0.1%
3212
 
0.1%
46
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335908
99.2%
11989
 
0.6%
2477
 
0.1%
3212
 
0.1%
46
 
< 0.1%

Ba1Chakukaisu2
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335995 
1
 
2028
2
 
350
3
 
213
4
 
6

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335995
99.2%
12028
 
0.6%
2350
 
0.1%
3213
 
0.1%
46
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335995
99.2%
12028
 
0.6%
2350
 
0.1%
3213
 
0.1%
46
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0335995
99.2%
12028
 
0.6%
2350
 
0.1%
3213
 
0.1%
46
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335995
99.2%
12028
 
0.6%
2350
 
0.1%
3213
 
0.1%
46
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335995
99.2%
12028
 
0.6%
2350
 
0.1%
3213
 
0.1%
46
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335995
99.2%
12028
 
0.6%
2350
 
0.1%
3213
 
0.1%
46
 
< 0.1%

Ba1Chakukaisu3
Real number (ℝ≥0)

ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.009388881013
Minimum0
Maximum5
Zeros336163
Zeros (%)99.3%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1251731673
Coefficient of variation (CV)13.3320645
Kurtosis428.4542935
Mean0.009388881013
Median Absolute Deviation (MAD)0
Skewness18.16900931
Sum3179
Variance0.0156683218
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0336163
99.3%
11917
 
0.6%
2354
 
0.1%
388
 
< 0.1%
460
 
< 0.1%
510
 
< 0.1%
ValueCountFrequency (%)
0336163
99.3%
11917
 
0.6%
2354
 
0.1%
388
 
< 0.1%
460
 
< 0.1%
ValueCountFrequency (%)
510
 
< 0.1%
460
 
< 0.1%
388
 
< 0.1%
2354
 
0.1%
11917
0.6%

Ba1Chakukaisu4
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.009176235706
Minimum0
Maximum6
Zeros336004
Zeros (%)99.2%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1190707404
Coefficient of variation (CV)12.97598974
Kurtosis671.9708479
Mean0.009176235706
Median Absolute Deviation (MAD)0
Skewness20.61354341
Sum3107
Variance0.01417784122
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0336004
99.2%
12246
 
0.7%
2255
 
0.1%
347
 
< 0.1%
625
 
< 0.1%
415
 
< 0.1%
ValueCountFrequency (%)
0336004
99.2%
12246
 
0.7%
2255
 
0.1%
347
 
< 0.1%
415
 
< 0.1%
ValueCountFrequency (%)
625
 
< 0.1%
415
 
< 0.1%
347
 
< 0.1%
2255
 
0.1%
12246
0.7%

Ba1Chakukaisu5
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
336030 
1
 
2191
2
 
265
4
 
64
3
 
42

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0336030
99.2%
12191
 
0.6%
2265
 
0.1%
464
 
< 0.1%
342
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0336030
99.2%
12191
 
0.6%
2265
 
0.1%
464
 
< 0.1%
342
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0336030
99.2%
12191
 
0.6%
2265
 
0.1%
464
 
< 0.1%
342
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0336030
99.2%
12191
 
0.6%
2265
 
0.1%
464
 
< 0.1%
342
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0336030
99.2%
12191
 
0.6%
2265
 
0.1%
464
 
< 0.1%
342
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0336030
99.2%
12191
 
0.6%
2265
 
0.1%
464
 
< 0.1%
342
 
< 0.1%

Ba1Chakukaisu6
Real number (ℝ≥0)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.08332447311
Minimum0
Maximum10
Zeros320403
Zeros (%)94.6%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4518584196
Coefficient of variation (CV)5.42287761
Kurtosis135.535806
Mean0.08332447311
Median Absolute Deviation (MAD)0
Skewness9.652991954
Sum28213
Variance0.2041760314
MonotocityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
0320403
94.6%
113021
 
3.8%
23062
 
0.9%
31085
 
0.3%
4350
 
0.1%
5222
 
0.1%
6173
 
0.1%
7137
 
< 0.1%
1082
 
< 0.1%
930
 
< 0.1%
ValueCountFrequency (%)
0320403
94.6%
113021
 
3.8%
23062
 
0.9%
31085
 
0.3%
4350
 
0.1%
ValueCountFrequency (%)
1082
< 0.1%
930
 
< 0.1%
827
 
< 0.1%
7137
< 0.1%
6173
0.1%

Ba2Chakukaisu1
Real number (ℝ≥0)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5687257821
Minimum0
Maximum12
Zeros247146
Zeros (%)73.0%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum12
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.166710687
Coefficient of variation (CV)2.051446803
Kurtosis7.05209127
Mean0.5687257821
Median Absolute Deviation (MAD)0
Skewness2.522794802
Sum192566
Variance1.361213827
MonotocityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
0247146
73.0%
141328
 
12.2%
221811
 
6.4%
314498
 
4.3%
48292
 
2.4%
53463
 
1.0%
61141
 
0.3%
7603
 
0.2%
8239
 
0.1%
964
 
< 0.1%
ValueCountFrequency (%)
0247146
73.0%
141328
 
12.2%
221811
 
6.4%
314498
 
4.3%
48292
 
2.4%
ValueCountFrequency (%)
127
 
< 0.1%
964
 
< 0.1%
8239
 
0.1%
7603
0.2%
61141
0.3%

Ba2Chakukaisu2
Real number (ℝ≥0)

ZEROS

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5526149466
Minimum0
Maximum11
Zeros251201
Zeros (%)74.2%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum11
Range11
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.216826786
Coefficient of variation (CV)2.201943312
Kurtosis11.59694356
Mean0.5526149466
Median Absolute Deviation (MAD)0
Skewness3.046659076
Sum187111
Variance1.480667427
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0251201
74.2%
140823
 
12.1%
221593
 
6.4%
311125
 
3.3%
46504
 
1.9%
53730
 
1.1%
61946
 
0.6%
7860
 
0.3%
8331
 
0.1%
10192
 
0.1%
Other values (2)287
 
0.1%
ValueCountFrequency (%)
0251201
74.2%
140823
 
12.1%
221593
 
6.4%
311125
 
3.3%
46504
 
1.9%
ValueCountFrequency (%)
11107
 
< 0.1%
10192
 
0.1%
9180
 
0.1%
8331
 
0.1%
7860
0.3%

Ba2Chakukaisu3
Real number (ℝ≥0)

ZEROS

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5350244542
Minimum0
Maximum11
Zeros246104
Zeros (%)72.7%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum11
Range11
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.122112988
Coefficient of variation (CV)2.097311589
Kurtosis10.71163554
Mean0.5350244542
Median Absolute Deviation (MAD)0
Skewness2.902753687
Sum181155
Variance1.259137558
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0246104
72.7%
147427
 
14.0%
222414
 
6.6%
311490
 
3.4%
45817
 
1.7%
52921
 
0.9%
61296
 
0.4%
7602
 
0.2%
8265
 
0.1%
9162
 
< 0.1%
Other values (2)94
 
< 0.1%
ValueCountFrequency (%)
0246104
72.7%
147427
 
14.0%
222414
 
6.6%
311490
 
3.4%
45817
 
1.7%
ValueCountFrequency (%)
1149
 
< 0.1%
1045
 
< 0.1%
9162
 
< 0.1%
8265
0.1%
7602
0.2%

Ba2Chakukaisu4
Real number (ℝ≥0)

ZEROS

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5219172337
Minimum0
Maximum11
Zeros243609
Zeros (%)71.9%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum11
Range11
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.069501
Coefficient of variation (CV)2.049177399
Kurtosis10.48584399
Mean0.5219172337
Median Absolute Deviation (MAD)0
Skewness2.849407061
Sum176717
Variance1.143832388
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0243609
71.9%
151409
 
15.2%
223123
 
6.8%
310807
 
3.2%
44877
 
1.4%
52773
 
0.8%
61362
 
0.4%
7265
 
0.1%
8177
 
0.1%
9116
 
< 0.1%
Other values (2)74
 
< 0.1%
ValueCountFrequency (%)
0243609
71.9%
151409
 
15.2%
223123
 
6.8%
310807
 
3.2%
44877
 
1.4%
ValueCountFrequency (%)
1141
 
< 0.1%
1033
 
< 0.1%
9116
< 0.1%
8177
0.1%
7265
0.1%

Ba2Chakukaisu5
Real number (ℝ≥0)

ZEROS

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5037537804
Minimum0
Maximum9
Zeros242093
Zeros (%)71.5%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum9
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.015864293
Coefficient of variation (CV)2.016588923
Kurtosis10.21262323
Mean0.5037537804
Median Absolute Deviation (MAD)0
Skewness2.815499258
Sum170567
Variance1.031980262
MonotocityNot monotonic
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0242093
71.5%
155405
 
16.4%
222790
 
6.7%
310014
 
3.0%
44515
 
1.3%
52344
 
0.7%
6646
 
0.2%
7524
 
0.2%
8133
 
< 0.1%
9128
 
< 0.1%
ValueCountFrequency (%)
0242093
71.5%
155405
 
16.4%
222790
 
6.7%
310014
 
3.0%
44515
 
1.3%
ValueCountFrequency (%)
9128
 
< 0.1%
8133
 
< 0.1%
7524
 
0.2%
6646
 
0.2%
52344
0.7%

Ba2Chakukaisu6
Real number (ℝ≥0)

ZEROS

Distinct39
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.558749173
Minimum0
Maximum46
Zeros83248
Zeros (%)24.6%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q35
95-th percentile13
Maximum46
Range46
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.63427313
Coefficient of variation (CV)1.302219658
Kurtosis8.044551369
Mean3.558749173
Median Absolute Deviation (MAD)2
Skewness2.398651893
Sum1204964
Variance21.47648745
MonotocityNot monotonic
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
083248
24.6%
166405
19.6%
245473
13.4%
332165
 
9.5%
421907
 
6.5%
516458
 
4.9%
612543
 
3.7%
710717
 
3.2%
88398
 
2.5%
96904
 
2.0%
Other values (29)34374
10.2%
ValueCountFrequency (%)
083248
24.6%
166405
19.6%
245473
13.4%
332165
 
9.5%
421907
 
6.5%
ValueCountFrequency (%)
4646
 
< 0.1%
4111
 
< 0.1%
38133
< 0.1%
3794
< 0.1%
361
 
< 0.1%

Ba3Chakukaisu1
Real number (ℝ≥0)

ZEROS

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2451209716
Minimum0
Maximum8
Zeros284762
Zeros (%)84.1%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6627322956
Coefficient of variation (CV)2.703694798
Kurtosis14.85048482
Mean0.2451209716
Median Absolute Deviation (MAD)0
Skewness3.47756838
Sum82996
Variance0.4392140956
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0284762
84.1%
134539
 
10.2%
212299
 
3.6%
34922
 
1.5%
41492
 
0.4%
5400
 
0.1%
6123
 
< 0.1%
753
 
< 0.1%
82
 
< 0.1%
ValueCountFrequency (%)
0284762
84.1%
134539
 
10.2%
212299
 
3.6%
34922
 
1.5%
41492
 
0.4%
ValueCountFrequency (%)
82
 
< 0.1%
753
 
< 0.1%
6123
 
< 0.1%
5400
 
0.1%
41492
0.4%

Ba3Chakukaisu2
Real number (ℝ≥0)

ZEROS

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2377374539
Minimum0
Maximum8
Zeros288376
Zeros (%)85.2%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6857793531
Coefficient of variation (CV)2.884607965
Kurtosis20.97063515
Mean0.2377374539
Median Absolute Deviation (MAD)0
Skewness3.989717459
Sum80496
Variance0.4702933212
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0288376
85.2%
131483
 
9.3%
211593
 
3.4%
34453
 
1.3%
41589
 
0.5%
5718
 
0.2%
6259
 
0.1%
8121
 
< 0.1%
ValueCountFrequency (%)
0288376
85.2%
131483
 
9.3%
211593
 
3.4%
34453
 
1.3%
41589
 
0.5%
ValueCountFrequency (%)
8121
 
< 0.1%
6259
 
0.1%
5718
 
0.2%
41589
 
0.5%
34453
1.3%

Ba3Chakukaisu3
Real number (ℝ≥0)

ZEROS

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.231222238
Minimum0
Maximum7
Zeros286373
Zeros (%)84.6%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6305127553
Coefficient of variation (CV)2.726869011
Kurtosis14.0780224
Mean0.231222238
Median Absolute Deviation (MAD)0
Skewness3.438231528
Sum78290
Variance0.3975463346
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0286373
84.6%
134430
 
10.2%
211818
 
3.5%
34112
 
1.2%
41496
 
0.4%
5278
 
0.1%
681
 
< 0.1%
74
 
< 0.1%
ValueCountFrequency (%)
0286373
84.6%
134430
 
10.2%
211818
 
3.5%
34112
 
1.2%
41496
 
0.4%
ValueCountFrequency (%)
74
 
< 0.1%
681
 
< 0.1%
5278
 
0.1%
41496
 
0.4%
34112
1.2%

Ba3Chakukaisu4
Real number (ℝ≥0)

ZEROS

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2223088555
Minimum0
Maximum8
Zeros287120
Zeros (%)84.8%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6211374209
Coefficient of variation (CV)2.794029142
Kurtosis19.90715047
Mean0.2223088555
Median Absolute Deviation (MAD)0
Skewness3.844529266
Sum75272
Variance0.3858116956
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0287120
84.8%
135750
 
10.6%
210360
 
3.1%
33634
 
1.1%
41048
 
0.3%
5481
 
0.1%
6138
 
< 0.1%
848
 
< 0.1%
713
 
< 0.1%
ValueCountFrequency (%)
0287120
84.8%
135750
 
10.6%
210360
 
3.1%
33634
 
1.1%
41048
 
0.3%
ValueCountFrequency (%)
848
 
< 0.1%
713
 
< 0.1%
6138
 
< 0.1%
5481
0.1%
41048
0.3%

Ba3Chakukaisu5
Real number (ℝ≥0)

ZEROS

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2153270012
Minimum0
Maximum7
Zeros286220
Zeros (%)84.5%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5880329796
Coefficient of variation (CV)2.730883615
Kurtosis18.77318828
Mean0.2153270012
Median Absolute Deviation (MAD)0
Skewness3.723778858
Sum72908
Variance0.3457827851
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0286220
84.5%
138188
 
11.3%
29808
 
2.9%
33210
 
0.9%
4585
 
0.2%
5384
 
0.1%
6165
 
< 0.1%
732
 
< 0.1%
ValueCountFrequency (%)
0286220
84.5%
138188
 
11.3%
29808
 
2.9%
33210
 
0.9%
4585
 
0.2%
ValueCountFrequency (%)
732
 
< 0.1%
6165
 
< 0.1%
5384
 
0.1%
4585
 
0.2%
33210
0.9%

Ba3Chakukaisu6
Real number (ℝ≥0)

ZEROS

Distinct27
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.659918132
Minimum0
Maximum34
Zeros148213
Zeros (%)43.8%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile7
Maximum34
Range34
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.506906864
Coefficient of variation (CV)1.510259342
Kurtosis12.20327697
Mean1.659918132
Median Absolute Deviation (MAD)1
Skewness2.791941412
Sum562035
Variance6.284582026
MonotocityNot monotonic
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
0148213
43.8%
172819
21.5%
240146
 
11.9%
324994
 
7.4%
416563
 
4.9%
510815
 
3.2%
67516
 
2.2%
75028
 
1.5%
83356
 
1.0%
92633
 
0.8%
Other values (17)6509
 
1.9%
ValueCountFrequency (%)
0148213
43.8%
172819
21.5%
240146
 
11.9%
324994
 
7.4%
416563
 
4.9%
ValueCountFrequency (%)
3419
 
< 0.1%
2852
< 0.1%
2415
 
< 0.1%
2331
< 0.1%
2245
< 0.1%

Ba5Chakukaisu1
Real number (ℝ≥0)

ZEROS

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5399832246
Minimum0
Maximum9
Zeros240470
Zeros (%)71.0%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum9
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.051818474
Coefficient of variation (CV)1.947872501
Kurtosis6.93250647
Mean0.5399832246
Median Absolute Deviation (MAD)0
Skewness2.448855409
Sum182834
Variance1.106322103
MonotocityNot monotonic
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0240470
71.0%
151261
 
15.1%
223694
 
7.0%
313714
 
4.1%
46090
 
1.8%
52170
 
0.6%
6709
 
0.2%
7323
 
0.1%
8131
 
< 0.1%
930
 
< 0.1%
ValueCountFrequency (%)
0240470
71.0%
151261
 
15.1%
223694
 
7.0%
313714
 
4.1%
46090
 
1.8%
ValueCountFrequency (%)
930
 
< 0.1%
8131
 
< 0.1%
7323
 
0.1%
6709
 
0.2%
52170
0.6%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

ChuoChakukaisu6Ba1Chakukaisu1Ba1Chakukaisu2Ba1Chakukaisu3Ba1Chakukaisu4Ba1Chakukaisu5Ba1Chakukaisu6Ba2Chakukaisu1Ba2Chakukaisu2Ba2Chakukaisu3Ba2Chakukaisu4Ba2Chakukaisu5Ba2Chakukaisu6Ba3Chakukaisu1Ba3Chakukaisu2Ba3Chakukaisu3Ba3Chakukaisu4Ba3Chakukaisu5Ba3Chakukaisu6Ba5Chakukaisu1
080000002300361001010
180000002300361001010
280000002300361001010
380000002300361001010
490000003142260201020
590000003142260201020
690000003142260201020
790000003142260201020
890000003142260201020
990000003142260201020

Last rows

ChuoChakukaisu6Ba1Chakukaisu1Ba1Chakukaisu2Ba1Chakukaisu3Ba1Chakukaisu4Ba1Chakukaisu5Ba1Chakukaisu6Ba2Chakukaisu1Ba2Chakukaisu2Ba2Chakukaisu3Ba2Chakukaisu4Ba2Chakukaisu5Ba2Chakukaisu6Ba3Chakukaisu1Ba3Chakukaisu2Ba3Chakukaisu3Ba3Chakukaisu4Ba3Chakukaisu5Ba3Chakukaisu6Ba5Chakukaisu1
33858200000000010000000000
33858310000000000010000000
33858430000000100020000000
33858510000000000010000000
33858630000000000000000000
33858720000000000020000000
338588120000004123090110020
33858920000000000020000000
33859050000000000030000010
33859110000000000010000000

Duplicate rows

Most frequent

ChuoChakukaisu6Ba1Chakukaisu1Ba1Chakukaisu2Ba1Chakukaisu3Ba1Chakukaisu4Ba1Chakukaisu5Ba1Chakukaisu6Ba2Chakukaisu1Ba2Chakukaisu2Ba2Chakukaisu3Ba2Chakukaisu4Ba2Chakukaisu5Ba2Chakukaisu6Ba3Chakukaisu1Ba3Chakukaisu2Ba3Chakukaisu3Ba3Chakukaisu4Ba3Chakukaisu5Ba3Chakukaisu6Ba5Chakukaisu1count
1642300000000000000000005048
871200000000000000000004648
2557400000000000000000004285
3606500000000000000000003389
1688300000000000100000002661
322100000000000000000002564
2600400000000000100000002505
3657500000000000100000002182
917200000000000100000002132
4764600000000000000000002071